Lecture 12: Binocular Stereo and Belief Propagation

نویسنده

  • A. L. Yuille
چکیده

Binocular Stereo is the process of estimating three-dimensional shape (stereo) from two eyes (binocular) or two cameras. Usually it is just called stereo. It requires knowing the camera parameters (e.g., focal length, direction of gaze), as discussed in earlier lecture, and solving the correspondence problem – matching pixels between the left and right images. After correspondence has been solved then depth can be estimated by trigonometry (earlier lecture). The depth is inversely proportional to the disparity, which is the relative displacement between corresponding points. This lecture will discuss how to solve the correspondence to estimate the disparity can be formulated in terms of a probabilistic markov model which assumes that the surfaces of surfaces are piecewise smooth (like the weak membrane model). The epipolar line constraint (see earlier lecture) means that points in each image can only match to points on a one-dimensional line in the second image (provided the camera parameters are known). This enables a simplified formulation of stereo in terms of a series of independent one-dimensional problems which can each be formulated as inference on a graphical model without closed loops. Hence inference can be performed by dynamic programming (see earlier lecture). A limitation of these one-dimensional models is that they assume that the disparity/depth of neighboring pixels is independent unless they lie on the same epipolar line. This is a very restrictive assumption and it is better to impose dependence (e.g., smoothness) across the epipolar lines. But this prevents the use of dynamic programming. This motivates the use of the belief propagation (BP) algorithm which perform similarly to dynamic programming on models defined over graph structures without closed loops (i.e. are guaranteed to converge to the optimal estimate), but which also work well in practice as approximate inference algorithms on graphs with closed loops. The intuitive reasoning is that the one-dimensional models (which do dynamic programming) perform well on real images, hence introducing cross epipolar line terms will only have limited effects, and so belief propagation is likely to converge to the correct result (e.g., doing DP on the one-dim models which put you in the domain of attraction of the BP algorithm for the full model). This lecture will also introduce the belief propagation (BP) algorithm. In addition (not covered in class) we will describe the TRW and related algorithms. Note that since stereo is formulated as a Markov Model we can use a range of other algorithms to perform inference (such as the algorithms described earlier). Similarly we can apply BP to other Markov Models in vision. (Note that BP applies to Markov Models with pairwise connections but this can be extended to generalized belief propagation GBP which applies to Markov Models with higher order connections). Finally we point out a limitation of many current stereo algorithms. As known by Leonardo da Vinci, depth discontinuities in shapes can cause some points to be visible to one eye only, see figure (1). This is called half-occlusion. It affects the correspondence problem because it means that points in one eye may not have matching points in the other eye. Current stereo algorithms are formulated to be robust to this problem. However, the presence of unmatched points can be used as a cue to determine depth discontinuities and hence can yield information (this was done for one-dimensional models of stereo – Geiger, Ladendorf, and Yuille. Belhumeur and Mumford – which also introduced DP to this problem – except there was earlier work by Cooper and maybe Kanade?).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Graph-Cut and Belief-Propagation Stereo on Real-World Image Sequences

This paper deals with stereo correspondence search, using graph cuts and belief propagation, for estimating depth maps. The results following different preprocessing steps are evaluated, using the quality of the disparity map. Running times are also investigated. For evaluation purposes, different kinds of images have been used: reference images from the Middlebury Stereo website, synthetic dri...

متن کامل

Belief Propagation Implementation using CUDA on XPS

Disparity map generation is a significant component of vision-based driver assistance systems. This report describes an efficient implementation of a belief propagation algorithm on GPUs that can be used to speed up stereo image processing. For evaluation purposes, different kinds of images have been used: reference images from the Middlebury stereo website, and real-world stereo sequences, sel...

متن کامل

Integrating Stereo with Shape-from-Shading derived Orientation Information

Binocular stereo has been extensively studied for extracting the shape of a scene. The challenge is in matching features between two images of a scene; this is the correspondence problem. Shape from shading (SfS) is another method of extracting shape. This models the interaction of light with the scene surface(s) for a single image. These two methods are very different; stereo uses surface feat...

متن کامل

Occlusion Detection Using Warping and Cross- Checking Constraints for Stereo Matching

In this paper, we propose an occlusion detection algorithm which estimates occluded pixels automatically. In order to detect the occlusion, we obtain an initial disparity map with an optimization algorithm based on the modified constant-space belief propagation (CSBP) which has low complexity. The initial disparity map gives us clues for occlusion detection. These clues are the warping constrai...

متن کامل

A LSS-based registration of stereo thermal-visible videos of multiple people using belief propagation

In this paper, we propose a novel stereo method for registering foreground objects in a pair of thermal and visible videos of close-range scenes. In our stereo matching, we use Local Self Similarity (LSS) as similarity metric between thermal and visible images. In order to accurately assign disparities to depth discontinuities and occluded Region Of Interest (ROI), we have integrated color and ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012